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RAW SEQUENCE LISTING 

PATENT APPLICATION: US/09/7 0 3 , 8 09 A 



DATE: 06/21/2002 
TIME: 11:25:18 



Input Set : A:\119941-1083 Seq ID Listing.ST2 5.txt 
Output Set: N:\CRF3\06212002\l703809A.raw 



3 
5 
7 
9 
10 
12 
13 
15 
17 
19 
20 
21 
2 2 
24 
25 

2 7 
29 
31 
33 

3 5 
37 

3 9 
41 
43 

4 5 
4 7 
4 9 
M 



•:iicr 
■:120- 
•:130 
•;i4u 
■:14l 
•;150 
•:15l 
•:160 
•:170 
•:210 
-:21l 
<212 < 
-;213t 
< 4 0 0 ' 



APPLICANT: DeJong, Jeff L. 

TITLE OF INVENTION: Transcription Factors Related to TFIIA 

FILE REFERENCE: 119941-1083 

CUKKENT APPLICATION NUMBER: 09/703 , 809A 

CURRENT FILING DATE: 2000-10-30 

PRIOR APPLICATION NUMBER: 09/326,529 

PRIOR FILING DATE: 1999-06-04 



6 7 



NUMBER OF SEQ ID NOS : 18 T 

SOFTWARE: Patentln version 3.1 I 
SEQ ID NO: 1 
LENGTH : 1617 
TYPE: DNA 

ORGANISM: Homo sapien 
SEQUENCE : 1 

gctggaggtg ctgtcatggc ctgcctcaac ccggtgccta aactctacag 

gaagatgtaa ttgaaggagt tcggaatcta tttg^tgaag aaggtataga 

ttaaaagact tgaagcagct ctgggaaacc aaggttttgc agtctaaagc 

ttcttcagaa atagcatcca atcacctctg tttactcttc agttgccgca 

caaacattgc aatcgtcaac agcatcatta gttattcctg ctggtagaac 

tttaccacag cagaactggg cacttcaaac tccagtgcaa actttacttt 

cccattcatg taccagcagg tgtgacacta cagactgtat ctggtcacct 

aatgtaccaa ttatggtgac agagacttct ggaagagcag gtattcttca 

cagcaagtat ttcaacagct tggccagcct tcagtaatac aaactagtgt 

aatccatggt ctcttcaagc aactactgaa aaatcacaga gaattgaaac 

caacccgcaa ttctaccttc tgggccagta gataggaaac acttagaaaa 

gatatacttg tatctcctgg aaatgagcat. aaaarcgtg: ctgaagcttt 

oaggaaagtt ctcactataL cjgt.ot.tcca ggtgttgtat. 1 1. 1 r; too a oa 

a:aa.ittctg atgtggagtc agt.gctcagt ggtt:agcta g 

gatgagtccc tctccaoaag ccctcatggg gctctorac: a 

of. to.* ■: att z t.taaaaatag gatgtatgga tgtgatt^tg t. 

gaggaaccca gcaacatacc tgtatcagag aaggatc :ta a 

at.tcgggt.ta ctgatgatga tattggtgaa ataat.tcaag t 

totto^aatg aagaaatagg aagtaoaaga gatgcagatg a 

attgargggg gagat.ct.gaa ggtaoctgaa gaagaagotg a 

tcagoracaa acagtagtga taatgaagac cctcaagtaa a 

ttaaattctg gagatgatgt tagtgaacag gatgtgcoag a 



catggctca 
g cacgtga z 
aaag zaac-z 
1 1 zt ::aggt 
agatggaag 
jaatgaatr 
ragtatt to 
zat t at a ga 
rctgtttg a 



atctgtaatt 
ggaacaagtt 
aacagaagao 
oagcttgo 
tottocaagt 
tortggttat 
t ta taaagt o 
goat ooaatt 
t.coacaattg 
eg tgctacag 
t gooaocagt 
gr - g-gt oat 
gg* o- ct oaa 
aaat otgoat" 
t ga t. att cag 
aagaaat .at a 
ggatt t.aagc 
:ggtgataca 
t ct.agggaat 
a a atgagga t. 
agaggacoot 
:acggataa t 

i t t r-+- a t t- f rr 



D 



60 
120 
180 
24 0 
3 'Hi 

3 6 0 

4 20 
4 HO 
54 0 



l < ) o 
1:1 4 0 

12-" 0 
1 2 - 0 

1 32 0 



2 .1 1 > LENGTH : 4 76 
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RAW SEQUENCE LISTING DATE : 06/21/2002 

PATENT APPLICATION: US/0 9/7 0 3 , 8 0 9A TIME: 11:25:18 

Input Set : A:\119941-1083 Seq ID Listing.ST25.txt 
Output Set: N:\CRF3\06212002\I703809A.raw 

82 <212> TYPE: PRT 

83 <213> ORGANISM: Homo sapien 
85 <400> SEQUENCE: 2 

87 Met Ala Cys Leu Asn Pro Val Pro Lys Leu Tyr Arg Ser Val lie Glu 

88 1 5 10 15 

91 Asp Val lie Glu Gly Val Arg Asn Leu Phe Ala Glu Glu Gly lie Glu 

92 20 25 30 

95 Glu Gin Val Leu Lys Asp Leu Lys Gin Leu Trp Glu Thr Lys Val Leu 
9b 3 5 4 0 4 5 

99 Gin Ser Lys Ala Thr Glu Asp Phe Phe Arg Asn Ser lie Gin Ser Pro 

100 50 55 60 

103 Leu Phe Thr Leu Gin Leu Pro His Ser Leu His Gin Thr Leu Gin Ser 
1U4 tb 70 75 80 

107 Ser Thr Ala Ser Leu Val He Pro Ala Gly Arg Thr Leu Pro Ser Phe 

108 85 90 95 

111 Thr Thr Ala Glu Leu Gly Thr Ser Asn Ser Ser Ala Asn Phe Thr Phe 

112 100 105 110 

115 Pro Gly Tyr Pro He His Val Pro Ala Gly Val Thr Leu Gin Thr Val 

116 115 120 125 

119 Ser Gly His Leu Tyr Lys Val Asn Val Pro He Met Val Thr Glu Thr 

120 130 135 140 

123 Ser Gly Arg Ala Gly He Leu Gin His Pro He Gin Gin Val Phe Gin 

124 145 150 155 160 

127 Gin Leu Gly Gin Pro Ser Val He Gin Thr Ser Val Pro Gin Leu Asn 

128 165 170 175 

131 Pro Trp Ser Leu Gin Ala Thr Thr Glu Lys Ser Gin Arg lie Glu Thr 

132 180 185 190 

135 Val Leu Gin Gin Pro Ala lie Leu Pro Ser Gly Pro Val Asp Arg Lys 

136 195 200 205 

139 His Leu Glu Asn Ala Thr Ser Asp He Leu Val Ser Pro Gly Asn Glu 

140 210 215 220 

14 3 His Lys He Val Pro Glu Ala Leu Leu Cys His Gin Glu Ser Ser His 
144 225 230 235 240 

14/ Tyr lie Ser Leu Pro Gly Val Val Phe Ser Pro Gin Val Ser Gin Thr 

14 3 24 5 2 50 2 55 

1M Asn Ser Asp Val Glu Ser Val Leu Scr Gly Se.r Ala : >er Met Ala Gin 

15, 250 26 5 270 

155 Asn Leu His Asp Glu Ser Leu Ser Thr Ser Pro His Gly Ala Leu His 

1:-: 2 75 2 80 23 5 

159 Gin His Val Thr Asp lie Gin Leu His lie Leu Lys Asn Arg Met. Tyr 

160 290 2 95 300 

163 Gly Cys Asp Ser Val Lys Gin Pro Arg Asn lie Glu Glu Pro Ser Asn 

164 305 310 315 320 
167 lie Pro Val Ser Glu Lys Asp Ser Asn Ser Gin Val Asp Leu Ser He 
1 1 8 3 2 5 3 3 0 3 3 5 
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RAW SEQUENCE LISTING 

PATENT APPLICATION: US/09/70 3 , 809A 



DATE : C 6/21/2002 
TIME: 11:25:18 



Input Set : A:\119941-1083 Seq ID Listing.ST2 5.txt 
Output Set: N:\CRF3\06212002\l703809A.raw 



179 
180 
183 
184 
187 
188 
191 
192 
195 
196 
199 
200 
203 
204 
207 
208 
209 
210 
212 
213 
215 
217 
219 
221 
2 2 3 
225 
227 
229 
231 
233 
2 35 
237 
2 3 9 
2 .; i 



2 .; 9 

j :-i 

2 5 3 

•' r c 

2 57 
2 l *9 

261 
263 



Glu Asn Glu Phe Leu Gly Asn lie Asp 

370 375 
Glu Glu Glu Ala Asp Ser lie Ser Asn 
385 390 
Ser Asp Asn Glu Asp Pro Gin Val Asn 
405 

Asn Ser Gly Asp Asp Val Ser Glu Gin 
420 425 
Thr Asp Asn Val lie Val Cys Gin Tyr 

435 440 
Asn Lys Trp Lys Phe Tyr Leu Lys Asp 

450 455 
Arg Asp Tyr Val Phe Ala Lys Ala lie 
465 470 
<210> SEQ ID NO: 3 
<211> LENGTH: 3824 
<212 > TYPE : DNA 
<213> ORGANISM: Homo sapien 
<400> SEQUENCE: 3 

ggactttggg actggacaga cctggtcaca 
actttagagt caacctattt gatttcttga 
tccacaaatc caggcaaatg ggtcaccttt 
aagtcaaaga attttcctct ggagaatcaa 
aaccctcctg gcctcaggga atttcccagt 
tcctccccca ttgtagattt ttatttcagt 
acacctacca aagacttccc aggttttcct 
tatcetattc cagaatcatc ttcagacagc 
ttactgccta ccagaccaac atgtttatcc 
acacatccaa ctcccaaagt aggtcttcca 
ctaggattcc aaagtgatga tctcccccag 
tcaagtccat ttcggaaaga tgaaggeagt 
agcaaaaaga tgttctcatc aagaaacaag 
aagtgttcac tcaactatat ctgtgagaag 
gaeteaetta gaagtttgtc tatgcactgt 
gto:-:ccaca cactcttcag gagtcag:ca 
eetgagaaga agaatatgat gtct.t reey g 
eetggaggaa ttttgcagat gtattat.gaa 
eagettgate catattgtag gctttctgaa 
aaaar.ccacd ctgtgaagat tgaaeatgtg 
aagaeagaag tagttcatga acctgacata 
taccatgact tccttgaett tctgaotact 
gttteaaaac caaaaaagaa ctacgaggag 
ttttggggta aagtcacaaa agaaggaaaa 
tattiTcctct gctttgtgaa tgggaaeet.g 
ttgcegaagc gagatgaatc ctattatgag 



Gly 

Glu 

lie 
410 
Asp 

Asp 

Gly 



Gly 

Asp 
395 
Val 

Val 

Lys 

Val 

475 



Asp Leu Lys Val Pro 
380 

Ser Ala Thr Asn Ser 

400 

Glu Glu Asp Pro Leu 
415 

Pro Asp Leu Phe Asp 
430 

lie His Arg Ser Lys 
44 5 

Met Cys Phe Gly Gly 
460 

A 1 H Glu Trn 



gtctaggttc 
caagaccaca 
gatgatgatc 
ggtgtctgta 
ggatcttcct 
ccaggacctc 
ggca tcccca 
ccactcgcaa 
eatgeettg t 
gatgaagtta 
tttcagtatt 
gattcccatt 
gaga tgccta 
cttgaacatc 
eta tgtgctg 
aaat c rgga t. 
eaa t gggga z 
:a ggga t. tag 
:ccaaggttg 
r ct t aeaeag 
gagcagatgc 
gtggaggagg 
caagaaattt 
t ttgttgaaa 
gaa t get 1 1 1 
aaggact. cag 



ta catcttac 
a tctgatccc 
ctgetgttca 
ga eeaaa tgg 
ccac cagcag 
caagtaactc 
aageagggae 
tatcaggagg 
ta c ccagtga 
ate eteaaca 
ttegagagga 
teaecet tga 
t tgaceaaaa 
tccagtcagc 
a a gaa a atge 
ggr. rtt t rat 
ea a 1. 1 1 1 let. 
aaaa.a eea t t 
a gaa r t t rag 
aaaaa a ggaa 
tgaagttggg 
a g e t g a t g a a 
ce ttggaaat 
gtgetgtgat 
t a a eet tga a 
aaaaaaaggg 



tggtcgagca 
aaagatgtgc 
atet t rtcaa 
ae tgaage tg 
caetcete tc 
teetctttet 
tcatgtgc tt 
agaa tettec 
ccae teatgt 
gge tgaaagc 
etgtgctttt 
eeea eeagga 
aag e e taaat 
tgagaaceaa 
etetteettt 
g e t g agaa 1. 1 
g a =j a g t" 1 1 r a 
t laa gaga • a 
t t rag ragga 
ataerat t rt 
gteeaeat eg 
gttgeeaget 
tgtggaeaae 
a a e t e a a a 1 1 
t g a e r 1 1 g a g 
g a 1 1. g a t a 1. 1 



60 
120 
180 
24 0 

3 0 0 
3*0 

4 2 0 
4 8 0 
54 0 
6 0 0 
h 6 0 
/ 2 0 
1 8 0 
84 0 

'~\ <') (": 



i : ■ o 
i o i;. 

13 20 

14 4 0 

1 5 "» 0 
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RAW SEQUENCE LISTING 

PATENT APPLICATION: US/09/703 , 809A 



DATE: 06/21/2002 
TIME: 11:25:18 



Input Set : A:\119941-1083 Seq ID Listing.ST2 5.txt 
Output Set: N:\CRF3\06212 0 02\l7 0 3 8 09A.raw 



273 
275 
277 
279 
281 
283 
285 
287 
289 
291 
293 
295 

2 97 
299 
301 
303 
305 
307 
309 
311 
313 
315 
317 
319 
321 
323 
325 
327 
329 
331 
333 
335 
337 
339 

;> 4 2 

-1 3 
34 4 
34 5 
34 7 
34 9 
350 
353 

3 54 
357 
358 



aggtcctgtg acaatataag gatacacttt cctgtcccat 
tggaccatga acctccagag gcagaagtct ctgaaagcta 
ctggggagtt tacaggaact tgaatctgaa cctgtcattc 
aaatatgaga gtgcctacca ggcagtggta tggaagatag 
tcaagtctag atcatcccca ttgtctgtca tacaaattag 
attccctctg attggtatcc atttgctact gttcagtttt 
tcaaggacag aggtcaggtc tctgggagtg gagagtgatg 
cagcagcgag cttgctacaa catccagcct aaactctaca 
attgaaggag ttcggaatct atttgctgaa gaaggtatag 
ttgaagcagc tctgggaaac caaggttttg cagtctaaag 
aatagcatcc aatcacctct gtttactctt cagttgccgc 
caatcgtcaa cagcatcatt agttattcct gctggtagaa 
gcagaactgg grarttnaaa ctccagtgca aactttactt 
gtaccagcag gtgtgacact acagactgta tctggtcacc 
attatggtga cagagacttc tggaagagca ggtattcttc 
tttcaacagc ttggccagcc ttcagtaata caaactagtg 
tctcttcaag caactactga aaaatcacag agaattgaaa 
attctacctt ctgggccagt agataggaaa cacttagaaa 
gtatctcctg gaaatgagca taaaatcgtg cctgaagctt 
tctcactata tcagtcttcc aggtgttgta ttttctccac 
gatgtggagt cagtgctcag tggttcagct agcatggctc 
ctctccacaa gccctcatgg ggctctccac cagcacgtga 
cttaaaaata ggatgtatgg atgtgattct gtaaagcaac 
agcaacatac ctgtatcaga gaaggattct aattctcagg 
actgatgatg atattggtga aataattcaa gtagatggaa 
gaagaaatag gaagtacaag agatgcagat gagaatgaat 
ggagatctga aggtacctga agaagaagct gacagtattt 
aacagtagtg ataatgaaga ccctcaagta aacattgtag 
ggagatgatg ttagtgaaca ggatgtgcca gacctgtttg 
tgtcagtatg ataagattca tcgaagcaag aacaaatgga 
gttatgtgtt ttggagggag agactatgta tttgcaaaag 
taaaccttgt gagctcagta catctatttt gtgaacatca 
gtgaattcat ttttattttg aatatagtcc agcacagagc 
tgtatggaat ttaataaaat tataattcag atgcagatac 

210 > SEQ ID NO: 4 

2 1 1 > LENGTH: 1181 
212 > TYPE: PRT 

213 > ORGANISM: Homo sap i en 
4 0 0> SEQUENCE: 4 
Met Cys Ser Thr Asn Pro Gly Lys Trp Val Thr 



cgcagtggat 
aaatgaaccg 
aagtcactgt 
atcggcttcc 
agct tggatc 
ccgtgcctga 
tccagccaca 
gatctgtaat 
aggaacaagt 
caacagaaga 
acagcttgca 
ctctt ccaag 
tt cctggtta 
tttataaagt 
agcatccaat 
ttccacaatt 
ccgtgc taca 
atgccaccag 
tgttgtgtca 
aggtctctca 
aaaatctgca 
ctgata ttca 
caagaaatat 
tggatt taag 
gcggtgatac 
ttctagggaa 
caaatgagga 
aagaggaccc 
acacggataa 
aattctattt 
cca ttggtga 
gttggactat 
tgttcaaatt 
aatt 



caaggccctt 
ccgagcatgt 
ggggtcagca 
a g a c a a a a a t 
agaccaagaa 
cacctgtgc c 
gaaaca tgtt 
tgaaga tgta 
tttaaaaga c 
c ttcttcaga 
ccaaacattg 
ttttaccaca 
t cccat tea t 
caatgtacca 
tcagcaagta 
gaatccatgg 
gcaacccgca 
tgatatactt 
tcaggaaagt 
aacaaattct 
tgatgagtc c 
gc ttcatatt 
agaggaacc c 
ca ttcgggtt 
atctt coaat 
tattgaeggg 
ttcagccaca 
tttaaattct 
tgtaattgtc 
gaaagatggr 
tgcagagtgg 
attgeatatt 
tttagttcac 



1 



10 



Ala Val Gin Ser Ser Gin Lys Ser Lys Asn Phe 



20 



25 



Gly Val Cys Arg Pro Asn Gly Leu Lys Leu Asn 



35 



40 



Phe Asp Asp Asp Pro 

1 5 

Pro Leu Glu Asn Gin 

30 

Pro Pro Gly Leu Arg 
45 

^ >- p >- r o , i q ^ c p >- 



186>) 
19 2 0 
19M - 
2 04'' 
2100 
2 1 0 ( i 
2 2 2 0 
2 28'.' 
2 34 0 
2 4 'M 
.14 6 0 
2 52 0 
2 58 0 
2 64 0 
2 7 1.) 0 
2 7 6 u 
2 82 0 
2 8 80 

2 9 4 0 

3 0 0 0 
3 0 6 0 
312 0 
318 0 
js 2 4 0 
3 3 0 0 
3 3 0 0 
3 4 2 0 
•'■4 8'"' 
3 54 0 
3 6 0 0 
3 6 6 0 
3 72 0 
3780 
3824 
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RAW SEQUENCE LISTING 

PATENT APPLICATION: US/09/7 0 3 , 8 09A 



DATE: 06/21/2002 
TIME: 11:25:18 



Input Set : A:\119941-108 3 Seq ID Listing.ST2 5.txt 
Output Set: N:\CRF3\06212 0 02\I7 0 3 8 09A.raw 



369 Leu Ser Thr Pro Thr Lys Asp Phe Pro Gly Phe Pro Gly lie Pro Lys 

370 85 90 95 

373 Ala Gly Thr His Val Leu Tyr Pro lie Pro Glu Ser Ser Ser Asp Sor 

374 100 105 110 

377 Pro Leu Ala lie Ser Gly Gly Glu Ser Ser Leu Leu Pro Thr Arg Pro 

378 115 120 125 

381 Thr Cys Leu Ser His Ala Leu Leu Pro Ser Asp His Ser Cys Thr His 

382 130 135 140 

385 Pro Thr Pro Lys Val Gly Leu Pro Asp Glu Val Asn Pro Gin Gin Ala 

386 145 150 155 100 

389 Glu Ser Leu Gly Phe Gin Ser Asp Asp Leu Pro Gin Phe Gin Tyr Phe 

390 165 170 175 

39 3 Arg Glu Asp Cys Ala Phe ^er Ser Pro Phe Arg Lys Asp Glu Gly Ser 
394 180 185 190 

397 Asp Ser His Phe Thr Leu Asp Pro Pro Gly Ser Lys Lys Met Phe Ser 

398 195 200 205 

401 Ser Arg Asn Lys Glu Met Pro lie Asp Gin Lys Ser Leu Asn Lys Cys 

402 210 215 220 

405 Ser Leu Asn Tyr lie Cys Glu Lys Leu Glu His Leu Gin Ser Ala Glu 
406225 230 235 24 0 

409 Asn Gin Asp Ser Leu Arg Ser Leu Ser Met His Cys Leu Cys Ala Glu 

410 245 250 255 

413 Glu Asn Ala Ser Ser Phe Val Pro His Thr Leu Phe Arg Ser Gin Pro 

414 260 265 270 

417 Lys Ser Gly Trp Ser Phe Met Leu Arg lie Pro Glu Lys Lys Asn Met 

418 275 280 285 

421 Met Ser Ser Arg Gin Trp Gly Pro lie Phe Leu Lys Val Leu Pro Gly 

422 290 295 300 

425 Gly lie Leu Gin Met Tyr Tyr Glu Gin Gly Leu Glu Lys Pro Phe Lys 

426 305 310 315 320 

429 Glu lie Gin Leu Asp Pro Tyr Cys Arg Leu Ser Glu Pro Lys Val Glu 

430 325 330 335 

433 Asn Phe Ser Val Ala Gly Lys He His Thr Val Lys lie Glu His Val 

434 340 345 350 

4 37 Ser Tyr Thr Glu Lys Arg Lys His Ser Lys Thr Glu Val Val His G'i u 
4 3 8 3 55 36 0 36 5 

441 Pro Asp He Glu Gin Met Leu Lys Leu Gly Ser Thr Ser Tyr His Asp 

442 3 70 37 5 3KQ 

445 Phe Leu Asp Phe Leu Thr Thr Val Glu Glu Glu Leu Met Lys Leu Pre 
44 6 38 5 390 39 5 4oC 

449 Ala Val Ser Lys Pro Lys Lys Asn Tyr Glu Glu Gin Glu Tie Ser Leu 

450 405 410 41b 

4 53 Glu He Val Asp Asn Phe Trp Gly Lys Val Thr Lys Glu Gly Lys Phe 

4 54 42 0 425 4 30 

457 Val Glu Ser Ala Val He Thr Gin He Tyr Cys Leu Cys Phe Val Asn 
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VERIFICATION SUMMARY DATE: 06/21/2002 

PATENT APPLICATION: US/09/7 0 3 , 809A TIME: 11:25:19 

Input Set : A:\119941-1083 Seq ID Listing.ST2 5.txt 
Output Set: N:\CRF3\06212002\I703809A.raw 
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